Rank in Wordlist | Frequency | Word |
---|---|---|
4615 | 4 | 1,5 |
5903 | 3 | 4,5 |
7969 | 2 | 0,6 |
7980 | 2 | 1,1 |
7981 | 2 | 1,88 |
8056 | 2 | 2,3 |
12823 | 1 | $14,95/an |
12829 | 1 | 0,1 |
12830 | 1 | 0,15 |
12831 | 1 | 0,2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
9231 | 2 | Un(e |
12576 | 2 | transformé(s |
13810 | 1 | AFR(anius |
15612 | 1 | E(X |
15710 | 1 | Emmenthal(BE |
17575 | 1 | M(arcus |
17952 | 1 | Montagne(s |
18411 | 1 | PROF(essus |
19252 | 1 | Résultat(s |
19796 | 1 | Suisse(2203)Médias(104)Kessava |
Rank in Wordlist | Frequency | Word |
---|---|---|
12824 | 1 | %) |
12826 | 1 | 0)33 |
12827 | 1 | 0)4 |
12828 | 1 | 0)44 |
13221 | 1 | 2)et |
13263 | 1 | 2011)▼ |
14219 | 1 | B.Leutenegger)avec |
16243 | 1 | Glâne)» |
19796 | 1 | Suisse(2203)Médias(104)Kessava |
21002 | 1 | ami(e)s |
Rank in Wordlist | Frequency | Word |
---|---|---|
1987 | 10 | 10% |
2206 | 9 | 50% |
2440 | 8 | 30% |
2786 | 7 | 70% |
3225 | 6 | 100% |
3867 | 5 | %. |
3881 | 5 | 60% |
3882 | 5 | 9% |
4628 | 4 | 4% |
4636 | 4 | 90% |
Rank in Wordlist | Frequency | Word |
---|---|---|
8272 | 2 | Bits&Bites |
13845 | 1 | AT&T |
14217 | 1 | B&B |
14682 | 1 | CONF-WBT&WBTG |
14683 | 1 | CONF-WCA&WCB |
17601 | 1 | METHOD&VISION |
18917 | 1 | R&D |
19267 | 1 | S&P |
Rank in Wordlist | Frequency | Word |
---|---|---|
12823 | 1 | $14,95/an |
13433 | 1 | 30,00$ |
20185 | 1 | US$ |
Rank in Wordlist | Frequency | Word |
---|---|---|
65 | 227 | d'un |
73 | 196 | d'une |
108 | 130 | c'est |
126 | 112 | qu'il |
161 | 96 | C'est |
172 | 90 | n'est |
269 | 58 | d'autres |
332 | 49 | n'a |
336 | 49 | qu'ils |
348 | 48 | s'est |
Rank in Wordlist | Frequency | Word |
---|---|---|
2590 | 8 | et/ou |
5890 | 3 | 2010/2011 |
8063 | 2 | 2009/2010 |
8078 | 2 | 24/24h |
12823 | 1 | $14,95/an |
12850 | 1 | 04/2010 |
12888 | 1 | 1.19/min |
12894 | 1 | 1/2 |
12895 | 1 | 1/66ème |
12904 | 1 | 10/11 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots